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Abstract 

We study two types of problems related with scheduling on a machine of varying speed. 
In a static model, the speed function is (through an oracle) part of the input and we ask for 
a cost-efficient scheduling solution. In a dynamic model, deciding upon the speed is part of 
the scheduling problem and we are interested in the tradeoff between scheduling cost and 
speed-scaling cost, that is energy consumption. Such problems are relevant in production 
planning, project management, and in power-management of modern microprocessors. 

We consider scheduling to minimize the total weighted completion time. As our main 
result, we present a PTAS for the static and the dynamic problem of scheduling on a single 
machine of varying speed. As a key to our results, we re-interprete our problem within 
the folkloric two-dimensional Gantt chart: instead of the standard approach of scheduling 
in the time-dimension, we construct scheduling solutions in the weight-dimension. We 
also give complexity results, more efficient algorithms for special cases, and a simple (2 -|- 
e)-approximation for preemptive dynamic speed-scaling with release dates. Our results 
also apply to the closely related problem of scheduling to minimize generalized global cost 
functions. 

Key words: scheduling, speed-scaling, power-management, generalized cost functions, non- 
availability periods, energy-aware 

1 Introduction 

In several computation and production environments we face scheduling problems in which the 
speed of resources may vary. We distinguish mainly two types of varying speed scenarios: one, in 
which the speed is static and cannot be influenced, and another dynamic setting in which deciding 
upon the processor speed is part of the scheduling problem. The static setting occurs, e.g., in 
production environments where the speed of a resource may change due to overloading, aging, or 
in an extreme case it may be completely unavailable due to maintenance or failure. The dynamic 
setting finds application particularly in modern computer architectures, where speed-scaling is an 
important tool for power-management. Here we are interested in the tradeoff between the power 
consumption and the quality-of-services. Both research directions — scheduling on a machine 
with given speed fluctuation as well as scheduling including speed-scaling — have been pursued 
quite extensively, but seemingly separately from each other. 

In this paper, we consider scheduling to minimize a standard measure for quality-of-service, 
the sum of weighted completion times ^jWjCj, on a machine of varying speed. For both 
settings, static and dynamic speed-scaling, we derive structural results, complexity results, and 
best possible (fully) polynomial time approximation schemes. Very useful in our arguments is the 
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geometric view of the min-sum scheduling problem in a two-dimensional Gantt chart. Crucial to 
our results is the deviation from the standard view of scheduling as a decision making process in 
the time dimension and switching to scheduling in the weight dimension, instead. This dual view 
allows us to cope with the highly sensitive speed changes in the time dimension which prohibit 
standard rounding, guessing, and approximation techniques. 

Previous work 

Research on scheduling on a machine of given varying speed has been mainly focused on the spe- 
cial case of scheduling with non-availability periods, see e.g. PI ITFIITTII^ . For min J2 "^j i ovAy 
recently the first constant approximation was derived in |10] . In fact, their (4-|-e)-approximation 
computes a universal sequence which has the same guarantee for any (unknown) speed function. 
If the speed is only increasing, there is known an efficient PTAS ''2V . In this case the complexity 
remains an open question, whereas for general speed functions the problem is strongly NP-hard, 
even when for each job the weight and processing time are equal [21] • 

The problem of scheduling on a machine of varying speed is equivalent to scheduling on 
an ideal machine (of constant speed) but minimizing a more general cost function, ^ Wjf{C'j), 
where / is a nondecreasing function. This identification is done via a straight forward transfor- 
mation, where /(C) denotes the time that the varying speed machine needs to process a work 
volume of C |12| . In particular, the special case of only nondecreasing (nonincreasing) speed 
functions corresponds to concave (convex) global cost functions. Recently, in [H] were given 
tight guarantees for the well-known Smith rule for all convex and all concave functions /. In 
this work is also shown that the problem for increasing piecewise linear cost functions is strongly 
NP-hard even for functions with only two slopes, and so is our problem when the machine speed 
takes only two distinct values. 

Even more general min-sum cost functions have been studied, where each job may have its 
individual nondecreasing cost function. A {2 + e)-approximation was recently derived in [5]. 
For the more complex setting with release dates there is a randomized O(loglog(nmaxj p-,))- 
approximation known j4] . Clearly, these results translate also to the setting with varying machine 
speed. 

Scheduling with dynamic speed-scaling was initiated in |23j and became a very active research 
field in the past fifteen years. Most work focuses on scheduling problems where jobs have 
deadlines by which they must finish. We refer to [IKin] for an overview. The line of research 
closest to our work seems to be the one initiated by Pruhs et al. [19], and later continued by 
many others; see, e.g., [31[51[7] and the references therein. Most of these works study online 
algorithms to minimize total (or weighted) fiow time plus energy. The closest result to ours 
seems to be a C'(l)-approximation algorithm for minimizing weighted flow time if the energy is 
controlled by a budget [5j. 

Our results 

We give several best possible algorithms for problem variants that involve scheduling to minimize 
the total weighted completion time on a single machine that may vary its speed. 

Our main result is an efficient PTAS (Section [3]) for scheduling to minimize ^WjCj on a 
machine of varying speed (given by an oracle). This is best possible since the problem is strongly 
NP-hard, even when the machine speed takes only two distinct values [H]. We also provide an 
FPTAS fSection l5.3p for the case that there is a constant number of time intervals with different 
uniform speeds (and the max ratio of speeds is bounded). Our results generalize recent previous 
results such as a PTAS on a machine with only increasing speeds |,21 and FPTASes for one 
non-availability period [H[|T5] . 

Our results cannot be obtained with standard scheduling techniques which heavily rely on 
rounding processing requirements or completion times. Such approaches typically fail on ma- 
chines that may change their speed since the slightest error introduced by rounding might provoke 
an unbounded increase in the solution cost. Similarly, adding any amount of idle time to the 
machine might be fatal. Our techniques completely avoid this difficulty by a change of paradigm. 
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To explain our ideas it is helpful to use a 2D-Gantt chart, see Section[21 As observed before, e.g., 
in we obtain a dual scheduling problem by looking at the y-axis in a 2D-Gantt chart and 
switching the roles of the processing times and weights. In other words, a dual solution describes 
a schedule by specifying the remaining weight of the system at the moment a job completes. 
This simple idea avoids the difficulties on the time- axis and allows to combine old with new 
techniques for scheduling on the weight-axis. 

In Section!?] we study the problem in which the machine can take an arbitrary non- negative 
speed (speed-scaling with a continuous spectrum of available speeds). For this case we show 
the rather surprising fact that the order of jobs in an optimal schedule is independent of the 
available energy. This follows by analyzing a convex program that models the optimal energy 
assignment for a given job permutation. We show that computing this universal optimal sequence 
corresponds to the problem of scheduling with a particular concave global cost function, which 
can be solved with our PTAS mentioned above, or with a PTAS for non-decreasing speed [21] . 
Interestingly, this reduction relies again on a problem transformation from time-space to weight- 
space in the 2D-Gantt chart. For a given scheduling sequence, we give an explicit formulae for 
computing the optimal energy (speed) assignment. Thus, we have a PTAS for speed-scaling and 
scheduling for a given energy budget. This, together with the fact that the optimal permutation 
is independent of the energy available, yields a (1 -I- e)-approximation of the Pareto curve which 
describes the near-optimal cost as a function of energy. 

In many applications, including most modern computer architectures, machines are only 
capable of using a given number of discrete power (speed) states. We also provide in Section [5] 
an efficient PTAS for this more complex scenario. This algorithm is again based on our techniques 
using dual schedules. Furthermore, we obtain a (1 -I- e)-approximation of the Pareto frontier for 
the energy-cost bicriteria problem. On the other hand, we show that this problem is NP-hard 
even when there are only two speed states. We complement this result by giving an FPTAS for 
a constant number of available speeds. 

It is not too hard to see that the (F)PTAS solution leads to a simple (2 + e)-approximation 
for the more general setting with release dates by applying preemptive list scheduling according 
to this order without changing the speed assignment. 

We remark that all our results for the setting with given speed translate directly to a corre- 
sponding result for the equivalent problem 1| | Wjf{Cj) (with / non-decreasing). 

2 Model, definitions, and preliminaries 
2.1 Problem definition 

We consider two types of scheduling problems. In both cases we are given a set of jobs J = 
{l,...,n} with work volume (processing time at speed 1) Vj > and weights Wj > 1. We 
seek a schedule on a single machine (permutation of jobs) that minimizes the sum of weighted 
completion times. The speed of the machine may vary — this is where the problems distinguish. 

In the problem scheduling on a machine of given varying speed we assume that the speed 
function s : R_(- — )> is given indirectly by an oracle. Given a value v, the oracle returns the 
first point in time when the machine can finish v units of work. That is, for a speed function s 
the oracle returns the value 



Thus, for a given order of jobs or for given start times, we can compute the execution time of 
each job and then the total cost of the solution (assuming that there is no idle time). 

In the problem scheduling with speed-scaling an algorithm determines not only a schedule for 
the jobs but will also decide at which speed s > the machine will run at any time. Running a 
machine at certain speed requires a certain amount of power. Power is typically modeled as a 
monomial (convex) function of speed, -P(s) = s", with a small constant a > 1. In applications, 
a is usually 2 or 3. Given an energy budget E, we ask for the optimal power (and thus speed) 
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Figure 1: 2D-Gantt chart. The z-axis shows a schedule, while the y-axis corresponds to 
W{t) = X^c >t plus the idle weight in the corresponding weight-schedule. 



distribution and corresponding schedule such that the weighted sum of completion times of 
all jobs is minimized. More generally, we are interested in quantifying the tradeoff between the 
scheduling objective X^jeJ ^j^j total energy consumption, that is, we aim for computing 

the Pareto curve for the bicriteria minimization problem. We consider two variants of speed- 
scaling: If the machine can run at an arbitrary speed level s € M.^, we say that we are in the 
continuous- speed setting. On the other hand, if that machine can only choose among a finite set 
of speeds {si, . . . , Sk}, we are in an discrete- speed environment. 

2.2 From time-space to weight-space 

For a schedule S, we let Cj{S) denote the completion time of j and we let W^{t) denote the total 
weight of jobs completed (strictly) after t. Note that by definition W^{t) is right-continuous, 
i. e., if Cj{S) = t, the weight of j does not count towards the remaining weight W'^{t). Whenever 
S is clear from the context we omit it. It is not hard to see that 



W^{t)dt. 



(1) 



Our main idea is to describe our schedule in terms of the remaining weight function W. 
That is, instead of determining Cj for each job j, we will implicitly describe the completion 
time of a job j by the value of W at the time that j completes. We call this value the starting 
weight of the job j, and denote it by 5™. Similarly, we define the completion weight of j as 
C™ :— SJ + Wj. This has a natural interpretation in the 2D-Gantt chart (see Figure [T|): A 
typical schedule determines completion times for jobs in time-space (x-axis), which is highly 
sensitive when the speed of the machine may vary. We call such a solution a time- schedule. 
Describing a scheduling solution in terms of remaining weight can be seen as scheduling in the 
weight-space (y-axis), yielding a weight- schedule. In weight-space the weights play the role of 
processing times. All notions that are usually considered in schedules apply in weight-space. For 
example, we say that a weight-schedule is feasible if there are no two jobs overlapping, and that 
the machine is idle at weight value w \i w ^ [S*™, C™] for all j. In this case we say that w is idle 
weight. 

A weight-schedule immediately defines a non-preemptive time-schedule by ordering the jobs 
on decreasing completion weights. Consider a weight-schedule S with completion weights > 
. . . > C™, and corresponding completion times Ci < . . . < C„. To simplify notation let Cq — 
C'!^j^i — 0. Then we define the cost of S as ^ I* is easy to check, even from 

the 2D-Gantt chart, that this value equals ^f^T^ where xf is the execution time of job 

j (in time-space). Moreover, the last expression is equivalent to Equation ([Ij if and only if the 
weight-schedule does not have any idle weight. In general, the cost of the weight-schedule can 
only overestimate the cost of the corresponding schedule in time space, given by ((T]). 

On a machine of varying speed, the weight-schedule has a number of technical advantages. 
For instance, while creating idle time can increase the cost arbitrarily, we can create idle weight 
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without provoking an unbounded increase in the cost. This idea yields a simple rule to delay 
one or more jobs in the time-schedule without increasing the cost. More precisely, we have the 
following crucial observation that can be easily seen in the 2D-Gantt chart. 

Observation 1. Consider a weight- schedule S with enough idle weight so that decreasing the 
completion weight of some job j yields a feasible weight- schedule. This changes the time- schedule, 
but the completion time of each job j' ^ j is not increased. Thus, this operation does not increase 
the total cost of S . 

We remark that this idea might not work if the completion weight of j is increased instead 
of decreased. 



3 A PTAS for scheduling on a machine with given speeds 

In what follows we give a PTAS for minimizing Wj Cj on a machine with a given speed 
function. In order to gain structure, we start by applying several modifications to the instance 
and optimal solution. First we round the weights of the jobs to the next integer power of 1 + £. 
This can only increase the objective function by a 1 + £ factor. 

Additionally, we discretize the weight-space in intervals that increase exponentially. That is, 
we consider intervals /„ = [(1 + e)""^, (1 for u G {1, . . . , i^} where v :— \ogi^^ Yl, 



l+e 



Wi 



We denote the length of each interval as := e(l -I- e)"^^. We will apply two important 
procedures to modify a schedule in weight-space. These techniques are useful in order to create 
idle weight by only increasing a 0{e) factor the cost. This allows us to apply Observation [TJ 
Similar techniques, applied in time-space, were used by Afrati et al. |T] for problems on constant- 
speed machines. 

1. Weight Stretch: We multiply by 1 -|- £ the completion weight of each job. This creates 
an idle weight interval of length ewj before the starting weight of job j. This operation 
increases the cost by a 1 -f £ factor. 

2. Stretch Intervals: The goal of this procedure is to create idle weight in each interval /„. 
To do so we delay the completion weight of each job j with Cj" G /„ by so that C™ 
belongs to lu+i. Moreover, the amount of weight that job j was processing in /„ equals 
the amount of weight that is processed in after modifying the schedule. This means 
that I — |-?^u| = £^(1 -I- e)"~^ = e\Iu+i\/ (1 + e) units of weight are left idle in I^+i after 
the transformation, unless there was only one job completely covering lu+i. By moving 
jobs within /„, we can assume that this idle weight is consecutive. This transformation 
increases the cost by at most a factor (1 -I- £)^ = 1-1- 0{e). Moreover, if some interval /„ 
contains the starting weight of some job, then /„ has now at least |/„ | ^ \Iu-i \ idle weight. 



3.1 Dynamic program 

Before giving further structure to near-optimal solutions, we explain our approach for obtaining 
a PTAS: we first describe a dynamic programming (DP) table with exponentially many entries 
and then discuss how to reduce its size. 

Consider a subset of jobs S C J and a partial schedule of S in the weight-space. In our 
dynamic program, S will correspond to the set of jobs at the beginning of the weight-schedule, 
i. e., if j € S and k E S \ J then Cj" < C^. A partial weight-schedule S of jobs in S implies a 
schedule in time-space with the following interpretation. Note that the makespan of the time- 
schedule is completely defined by the total work volume J^j "^j- We impose that the last job of 
the schedule, which corresponds to the first job in S, finishes at the makespan. This uniquely 
determines a value of Cj for each j G S, and thus also its execution time . The total cost of 
this partial schedule is X^jeS ■'^f^T ('^^ich has a simple interpretation in the 2D-Gantt chart). 

Consider J'u :— {S £ J : w{S) < (1 -I- e)"}- That is, a set G J-u is a potential set to be 
schedule in or before. For a given interval /„ and set S G J-u, we construct a table entry 



5 



T{u, S) with a (1 + C'(£))-approximation to the optimal cost of a weight-schedule of S subject 
to < (1 + e)" for all j e S. 

Consider now S ^ J-u and S' G J-^u-i with S" C S. Let 5 be a partial schedule of S where 
the set of jobs with completion weight in /„ is exactly S \ S'. We define APX„(S", S*) = (1 + 
e)" J2jes\S' -^f ' which is a (1 + e)-approximation to J2jes\S' ^f^T' partial cost associated 
to 5 \ S'. We remark that the values X]jGS\S' -^f ^^"^ APXi,(S", S) do not depend on the whole 
schedule S, but only on the total work volume of jobs in S' . 

We can compute T{u, S) with the following formula, 

T{u, S) = mm{T{u - 1, S") + APX„(5', S) : S' e Tu-u S' Q S}. 

This DP table contains an exponential number of entries, since J-"„ can be of exponential 
size. In the following we show that we can compute a set of polynomial size that yields a 
(1 + £)-approximation to the solution. We remark that the set will be universal, that is, the 
set is completely independent of the speed of the machine. Therefore, the same set can be used 
in the speed-scaling scenario. 

3.2 Light jobs 

We structure an instance by classifying jobs by their size in weight-space. 

Definition 2. In a given schedule, a job j is said to be light if Wj < e|/ti|, where u is such that 
5™ £ lu- A job that is not light is heavy. 

Given a weight-schedule for heavy jobs, we can greedily find a (l + 0(e))-approximate solution 
for the complete instance. To show this, consider any weight-schedule S. First, remove all light 
jobs. Then we move jobs within each interval /„, such that the idle weight inside each interval 
is consecutive. Clearly, this can only increase the cost of the solution by a 1 + e factor. After, 
we apply the following preemptive greedy algorithm to assign light jobs. 

Algorithm Smith in Weight-Space 

1. For u = 1, . . . ,1/ and each idle weight w € lu, process a job j maximizing Vj/wj among all 
available jobs with Wj < e\Iu\- 



To remove preemptions, we apply the Stretch Interval subroutine0 twice, creating an idle 
weight interval in /„ of length at least 2e|/„|/(l + e) > £\Iu\ (for e < 1). This gives enough 
space in each interval /„ to completely process the (unique) preempted light job with starting 
weight in /„. Then, Observation [T] implies that we can remove preemptions, obtaining a new 
schedule S' . We now show that the cost of S' is at most a factor of 1 + 0{e) larger than the 
cost of S. To do so we need a few definitions. 

For any weight-schedule iS, let us define the remaining volume function as 

V^{w):= ^ V,. 

j:C]">w 

For a given w, let Ij [w) be equal 1 if the weight-schedule processes j at weight w, and otherwise. 
Then, fj{w) :— {l/wj) Ij{w')dw' corresponds to the fraction of job j processed after w. With 
this we define the fractional remaining volume function, which is similar to the remaining volume 
function but treats light jobs as "liquid" : 

Vf{w):= fj{w)-vj+ Vj forallw>0. 

j:j is light j:j is heavy, Cj^ >it; 

We notice that Vf{w) < V^{w) for all u; > 0. 

^The Stretch Interval procedure also applies to preemptive settings by interpreting each piece of a job as an 
independent job. 
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Consider now the function f(v) corresponding to the earhest point in time in which the 
machine can process a work volume of v, i. e., 

f{v) inf jb > : ^ s{t)dt > w| for aU w > 0. 

Notice that this is the same function used when transforming our problem to 1| | J2j 
It is easy to see — even from the 2D-Gantt chart — that f {V'^ {'w))dw corresponds to the cost 
of the weight-schedule S. Also, notice that f{v) is non-decreasing, so that V^{w) < V'^ (w) for 
all w implies that the cost of S is at most the cost of S' . 

Lemma 3. The cost of S' is at most 1 -t- 0{s) times larger than the cost of S. 

Proof. Consider the weight-schedule Sh of heavy jobs before running Algorithm Smith in Weight- 
Space. Let 5/ be the preemptive schedule obtained after applying such algorithm. First we 
observe that for any given w is a lower-bound on (w) for any schedule S" that 

coincides with Sh on the heavy jobs. This follows by a simple exchange argument, since the 
greedy Smith-type rule in algorithm chooses the job that packs as much possible volume in the 
available weight among all light jobs. 

Observe that applying Stretch Interval twice can increase the starting weight of a job by at 
most a factor (1 -I- e)**. Also, applying this procedure to 5/ only moves pieces of jobs forwards. 
This implies that (w) < V^^ ((1 + e)^'^w) for all w. Moreover, each light job in S' completely 
finishes within an interval /„. Thus, V"^ (w) < V^\{1 We conclude that 

V^'{{l+e)^w) < Vf'{{l+e)^w) < vf^{w) < Vf{w) < V^{w) for all w>0. 
Taking the function /(•) on this sequence of inequalities and integrating implies that 

) poo 

f{V^'{{l+e)'^w))dw < f{V^{w))dw. 



Finally, the right hand side of this inequality is the cost of <S, and a simple change of variables 
implies that the left hand side is (1 -I- e)~^ times the cost of S' . The lemma follows. □ 

The next result follows directly from our previous result. 

Corollary 4. At a loss of a l+0{s) factor in the objective function, we can assume the following. 
For a given interval /„, consider any pair of jobs j,k whose weights are at most e\Iu\. If both 
jobs are processed in /„ or later and Vk/wk < Vj/wj, then CJ < C™. 



3.3 Localization 

The objective of this section is to compute, for each job j G J, two values r™ and dj so that 
job j is scheduled completely within [r^jdj) in some (1 -I- C'(£))-approximate weight-schedule. 
We call rj" and dj the release-weight and deadline-weight of job j, respectively. Crucially, we 
need that the length of the interval [r^ ^dj) is not too large, namely that dj G 0(poly(l/e)rj). 
Such values can be obtained by using Corollary H] and techniques from [T] . The release- and 
deadline- weights will help us finding a compact set 

We consider an initial value for rj" and then increase its value iteratively. We will restrict 
ourselves to values of rj that are integer powers of 1 +e. Consider an arbitrary weight-schedule. 
Applying the Weight Stretch subroutine, we obtain that the starting weight of each job satisfies 
SJ' > swj. Thus, we can safely define rj as ewj rounded down to an integer power of 1 e. We 
can now use the approach taken in [1]. Let J„ be the set of all jobs with rj equal to (1 + e)"^^. 
We partition J„ into light and heavy jobs, and then classify heavy jobs by its weight. Note 
that a heavy job in can have weights w with e\Iu\ < w < 1/£(1 -|- e)"~^, where the last 
inequality follows since r™ > ewj. Therefore for a fix m we only need to consider heavy jobs 
with a weight w e ilu ■= {(1 + e)* : £|/«| < (1 + e)' < l/e(l + e)""^}- Crucially, note that 
l^ul G ©(log]^,!.^ 1/e) C 0(l/e • logl/ e). Based on this we give the following decomposition of 
the set of jobs with a given released weight. 
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Definition 5. Given release weights for each job, we define Ju — {j '■ rj — {1 + e)""^}. 
Additionally, we decompose Ju into a set of light jobs := {j G Ju ■ Wj < e|/u|}, o,n,d a set 
Hu,w = {j G Ju ■ Wj = w} of heavy jobs of weight w for each w £ Vlu- 

Let us now consider all jobs in L„. If w{Lu) is more than then some of these jobs will 
have to start in lu+i- By Corollary U we can choose the set of possible jobs with starting weight 
in /„ greedily, and increase the release weight of the rest. Similarly, since the weight of each job 
in Hu^w is the same, we can always give priority to jobs with the largest processing time. With 
this idea we have the following lemma. 

Lemma 6. We can compute in polynomial time release-weights rj' for each job j such that 
there exists a (1 + 0{e))- approximate weight- schedule satisfying the release weights and for all 
interval lu, w{Ju) G 0{l/e^ ■ logl/e • \Iu\)- 

Proof. Initialize as ewj rounded down to an integer power of (1 + e). Let J„ be the set of 
jobs with release weights (1 + e)"^^. Decompose set J„ in L„ and sets Hu,w for all w £ r2„ as 
above. We consider jobs in L„ and each set Hu,w separately. 

By Corollary SI we know that within an interval we can order light jobs and processed first 
the job with largest Vj/wj ratio. Thus, if the total weight of jobs in Lu is larger than (1 + e)\Iu\ 
we can simply increase the release weight of the jobs with the largest Wj/vj to (1 + e)". We do 
this until w{Lu) < (1 + £)|-fu|- 

We do a similar technique for jobs in Hu.w If w{Hu,w) > \Iu\ + w and l-ffu^ml contains more 
than one job, then we can delay the release weight of the jobs with smallest Vj. This follows by 
a simple interchange argument, since if there are two jobs with the same weight then the one 
with smallest work has smaller (larger) completion time (weight) . 

With this we obtain a set Hu,w with 

w{Hu,u.) <\Iu\+w< \Iu\ + -{! + ef-^ e 0(l/e2) . 

£ 

We can iterate the two procedures described above until the following property holds: for all u 
and w e il„ we have that w{Lu) < (1 +e)|/u| and w{Hu.iu) G 0(l/e^) • |/ti|. The result follows 
since e ©(l/e • log 1/e). □ 

We use the previous lemma to define the deadline-weights by using the following idea. For 
s large enough (but constant). Stretch Intervals creates enough idle weight in lu+s to fit all 
jobs release at (1 + e)" that have not yet finished by (1 + e)""'"*+-'^. This allows us to apply 
Observation [TJ 

Lemma 7. We can compute in poly-time values and dj for each j € J such that: (i) there 
exists a (1 + 0(e)) -approximate weight-schedule that processes each job j within [rj',dj), (ii) 
there exists a constants G 0(log(l/e)/£) such that dj < r™-(l-|-£)*, (Hi) rj and d^ are integer 
powers of (1 + e), and (iv) the values an dj are independent of the speed of the machine. 

Proof. Consider the release weights given by the previous lemma and consider the associated 
sets Ju for each u. The construction of the release weights guarantees the existence of a value 
s G 0(logi+Jl/£4 .logl/e)) C 0(log(l/e)/£) such that u;(J„) < £|/„+,_i |/(1 + e). 

Now we apply Stretch Intervals to the schedule. This creates £|/„+s_i|/(l +£) idle weight in 
interval I^+s-i, unless there was one job completely covering lu+s-i- If that is not the case, then 
we can move all jobs in Ju with starting weight in lu+s or larger to be completely processed inside 
lu+s-i- By Observation [U doing this can only increase the objective function by a 1 + 0{e) 
factor. Similarly, if there was a job k completely covering lu+s-i, then the idle weight that 
lu+s-i should have contained can be considered to be just before the starting weight of k. In 
this case we can move all jobs in J„ that were being processed after lu+s-i to just before S*™. 

In either case we constructed a solution where each job in J„ is completely processed in 
[(1 + £)"-!, (1 + £)"+^-i). We conclude by defining dj = {1 + e)"+' for each job j G J„. 
Note that (iv) also follows since rj and dj defined as before do not depend on the machine's 
speed. □ 
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3.4 Compact Search Space 

Recall the definition of J\, given at the beginning of Section 13.11 and that we need to define 
a polynomial size version of J-u- We call this set Fu- Instead of describing a set S* S J-^u, we 
describe V — J\S, that is, the jobs with completion weights in lu+i or later. That is, we define 
a set Vu that will contain the complements of sets in J^ti. In order to define we use the 
release- and deadline- weights given by Lemma [T] If y G then V must contain all jobs with 
release weight in lu+i or after. Let V :— {j ^ J : rj > {1 + e)"}. 

Observation 8. Each set V G 2?„ is of the form V' U V , where every job j £ V' has rj < 

Thus we only need to describe all possibilities for V' . For a job j G V we can assume 
that dj > (1 + Therefore, by Lemma [3 we have that rf > {1 + where s G 

0(log(l/£)/e). 

Observation 9. Each set V e Vu is of the form [J V^,\uV, where := {j G V : 
rf = {l+eY}. 

Then, we aim to find a collection of subsets that can play the role of V^. If the size of this 
collection is at most a polynomial number k, we could conclude that < fc* = k^(^°si^/'^y'^\ 

In order to do so, recall that J„ denote all jobs with release- weights equal to (1 -I- e)", and 
that we can write Jy = ^«U(1J^ H^^y^,) where w ranges within C'(log]^_|_g 1/e) many values. Thus, 
defining V^ ^^ := fl H^^i^ we can further decompose as {V^ fl Ly) U (IJ^ Vv,w)- ^ow notice 
that ^ is a subset of Hy^^ which, as justified in the next observation, contains constantly 
many sets. 

Observation 10. Without loss of generality, we can restrict ourselves to consider sets ^ 
among 0{l/e) distinct options. 

Proof. Each job in Hy -uj for w G il^, and the total weight of Hy^y, is at most \Iy \ + w, thus Hy^y, 
contains at most 1 + \Iy\/w many jobs. Since by definition of Hy^yj we have that w > e\Iy\, we 
obtain that |i?t,,^| G 0{l/e). Moreover, all jobs in Hy^yj has the same weight w and the same 
release- weight. Therefore, we know that this jobs are order by their work volume in an optimal 
solution. Thus, we can restrict ourselves to sets ^, that respect this order. The observation 
follows since there are at most |-fft,.u)| G 0{l/e) many sets that respect this order. □ 

Given v, the index w ranges over \fly\ G C'(log(l/e)/e) many values. Thus the following 
holds. 

Observation 11. For each v the set {j^V^y^ can be chosen owr (l/e)'^('°s(i/'^)/'^' ^ 2°(i°s(i/'^)'/^) 
many alternatives. 

We use a similar argument for Vy fl Ly. Indeed, w{Ly) < (1 4- £)\Iy\ and jobs in L„ will be 
processed as light jobs (by Lemma [T]). We now show that we can group light jobs together in 
order to diminish the possibilities for Ly. This is done as follows. Let s be the number given by 
Lemma [71 We order jobs in Ly by inverse Smith's rule order. Then we greedily find groups of 
jobs in Ly by going in the list of jobs from left to right such that each group has total weight 
in [e|J^j/s, 2£|/t,|/s]. Recalling that w{Ly) G 0{\ly\), we obtain that this procedures creates at 
most 0{s/e) groups. Let Ly^i be the z-th groups. 

Lemma 12. There exists a (1 -f 0{e))- approximate weight schedule such that: (i) it satisfies 
the release- and deadline-weights of Lemma ^ (ii) in each group Ly^i all jobs are processed 
consecutively for each v,i, and (Hi) within each set Ly jobs are processed following reverse 
Smith's rule order. 
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Proof. Consider the schedule given by Lemma [71 First notice that this schedule must not 
necessarily satisfy the property of Corollary SI since in the proof of Lemma [7] we changed the 
order of jobs. However, in that proof for each v we moved jobs in together. Therefore, 
within each Jy the reverse Smith's rule order is preserved. Let us fix an interval . Within 
this interval, the schedule only processes jobs in J„ with v S {v' — s, . . . , v'}. Let us fix such a 
set Jy. Since we follow reverse Smith's rule within Jy, there is at most two sets Ly^i that are 
partially processed in lyi. They require 4£|/^|/s extra space within /„/ in order to be processed 
completely within /„/ . Summing over v 6 {v' — s, . . . , w' — 1}, we obtain that in total we require 

4e y M <4es^g0(£|/„,|) 
^-^ s s 

v—v'~s 

extra space in lyi . The result follows since we can create enough idle time within lyi by applying 
0(1) times the procedure Stretch Intervals. We remark that the procedure described works 
simultaneously for all intervals ly. □ 

With this lemma, we can find a compact description to VyHLy. Indeed, to specify Vy Ci Ly, 
i.e., the jobs in Ly that are processed in lu+i or later, we just need to determine the index i 
such that jobs in Lyj with j > i are Vy and jobs in Ly^j with j < i are not in Vy. Since i ranges 
over 0{s/e) € 0(log(l/£)/e^) many options, we obtain the following. 

Observation 13. The set Vy f] Ly can be chosen over 0(log(l/£)/e^) different options. 

Combining this last observations and Observation [TI] we obtain that Vy can take at most 
k < 2'-'('°s (1/'^)/'^) many different options. By Observation [SI we conclude that V' belongs to a 
set of size at most fc* < 

20(iog^(i/e)/e")^ With this and Observation [H we can define Vu having 
size at most 2^(i°s (i/^)/^ ). With our discussion we conclude the following lemma. 

Lemma 14. For each interval ly, we can construct in poly-time a set Ty that satisfies the 
following: (i) there exists a (1 + 0{e))- approximate weight- schedule in which the set of jobs 
with completion weight at most (1 + e)" belongs to Ty for each interval lu, (ii) the set J-y has 
cardinality at most 2'^('°s (i/^)/'^ )^ and (Hi) the set J-y is completely independent of the speed of 
the machine. 

With this lemma and the discussion at the beginning of this section we obtain a PTAS, 
which is best possible from an approximation point of view, since the problem is known to be 
strongly NP-hard [12]. 

Theorem 15. There exists an efficient PTAS for minimizing the weighted sum of completion 
times on a machine with given varying speed. 

Proof. We just need to argue about the running time. It is easy to see that the time for creating 
sets J-y is dominated by the time needed to solve the dynamic program. Moreover, the number 
of entries of the table is 2'^('°s (i/'^)/'^ ) . log(^^- Wj), and the time needed to fill each entry is 
20(iog^(i/e)/e^) Therefore the running time is 20(i°s'(i/e)/e^) . iog(^^. ^^.) . 20(iog'(i/s)/^') . „ = 
20(log^(l/c)/e^) .log(^^.^^.) .J, □ 

4 Speed-scaling for continuous speeds 

We now consider the problem of minimizing "^jWjCj for a given amount of energy available 
E. In this section we consider a continuous spectrum of speeds. It is easy to see that in this 
setting each job will be executed at a uniform speed because of the convexity of the power 
function |23| . Let Sj be the speed at which job j is running. Then j's power consumption 

is pj — sf, and its execution time is Xj — Vj/sj — Vj/p^J"' . The energy that is required for 
processing j is Ej ~ pj ■ xj ~ pj ■ ^ — s"^^ • vj — v'^ /x^~^. 
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The problem of computing the optimal energy (or speed) assignment for all jobs in a given 
job sequence tt using a total amount of energy E can be expressed as a convex program: 
Let jobs be indexed according to some given order n. We rewrite the objective function as 
^"^iWjCj = w'i ELi 2;fe = E"=i2;jELjWfc and define Wj = ELi^"^- Note that 

Xj = i^v" / Ej)^^^" ^\ Then the problem can be formulated as 

min Y.W,-[^] (2) 

n 

Y,E,<E (3) 

Ej>0, j€{l,...,n} (4) 

This program has linear constraints and a convex objective function (which follows from 
the convexity of the function 1/x). Such programs can be solved in polynomial time up to an 
arbitrary precision [18] with the Ellipsoid method. However, with a better understanding of its 
structure and by applying the well-known KKT conditions, we easily derive an explicit formulae 
for computing an optimal energy assignment. 

The problem ©-(HI) has a feasible solution since Ej — 0, for j — l,...,n satisfies both 
constraints, ([3]) and However, an optimal solution satisfies Constraint ^ with equality, 
since we allow arbitrary speeds and thus arbitrary energy assignments, and the smallest increase 
in the assigned energy will decrease the total cost. For the same reason and with a positive 
energy budget, an optimal solution will never assign zero energy to any job; hence, none of the 
Inequalities ^ is satisfied with equality. 

Now, we apply the well-known Karush-Kuhn- Tucker (KKT) conditions which give necessary 
and sufficient conditions on an optimal solution to convex programs with linear constraints [6]. 
With the observations above they reduce to the following conditions. 

Lemma 16 (KKT conditions). A vector (Ei,...,En) is an optimal solution to the convex 
program ^ -([4]) if and only if 

(a) {El, . . . , En) is feasible and satisfies ([3]) with equality but none of ([4]) with equality, and 

(b) there exists a parameter A > such that \Ig{Ei, . . . , En) + A • = 0. 

Here, ])f denotes a vector with ones in each coordinate. 

These conditions lead to an explicit description of the optimal solution of the convex pro- 
gram ©-(HI). 

Theorem 17. For a given job sequence it, a power function P{s) = s" and an energy budget E, 
the optimal energy assignment in an optimal schedule for minimizing WjCj subject to C7r(i) < 
. . . < Cjr^n) is determined by 

n 

E, ^ V, ■ W^-'^'- ■ f , where 7. = ^ ^^i^) ' 

Proof. To simplify notation let us reorder the jobs so that Ci < C2 < . . . < C„ and 7r(j) = j for 
all j (as above) . Then we need to show that the optimal energy assignment is given by 

E, = vj ■ Wf-^^l'^ ■ where 7 = ^ • M^j""'^/". 

Let {El, . . . ,En) be an optimal solution to the convex program ©-Q. By Lemma ITCT b). 
there is a A > such that for every job j G {1, . . . ,n} holds 

w, ■ . _L . + A = , 
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which is equivalent to 

/ -, \ (tt-l)/a 



(a-l)A 



To determine the lagrangean multipher A (or directly replace the last product term in ([5])), we 
use the fact that Inequality ([3]) is satisfied with equality, that is, 



Then, we can express the values Ej in ([5]) independently of A and conclude with 



□ 



Interestingly, the optimal job sequence is independent of the energy distribution, and even 
stronger, it is independent of the overall energy budget. In other words, one scheduling sequence 
is universally optimal for all energy budgets. Furthermore, this sequence is obtained by solv- 
ing in weight-space a (standard) scheduling problem with a cost function that depends on the 
power function. 

Theorem 18. Let a > 1 be a constant. Given a power function P{s) — s" , there is a universal 
sequence that minimizes "WjCj for any energy budget. The sequence is given by reversing an 

optimal solution of the scheduling problem 1| I'^WjC^"' 
Proof. Assuming that jobs are indexed following a sequence tt. 



where the last equation comes from the definition of 7^ (see Thm. [T7| and standard transforma- 
tions. This equation implies that the optimal job sequence is independent of the available energy 
budget since E only plays a role in the factor outside the sum, which is independent of the per- 
mutation. Since the exponent is constant, the problem of finding the optimal sequence under an 
optimal energy-distribution reduces to finding the sequence that minimizes "^j ' . 

Now recall the reinterpretation that the 2D-Gantt chart view offers (see Sect. [2]). Then this 
problem is equivalent to the scheduling problem in the weight-space with varying speed on the 
weight-axis or general cost function in the weight-space. This problem can be directly trans- 
lated into minimizing the total weighted completion time on a machine with varying speed 
(or the desired form with a generalized cost function) by re-interpreting weight-space as time- 
space (Section [2]). We simply define a new problem in time-space with processing times v'j — wj 

and weight w'j = Vj with the objective function of minimizing ^'j ' f {j2k=i ■ This is a 

problem of the desired type. By Section [5] it is easy to see, that a solution tt' to the new problem 
in time-space, has a corresponding solution tt in the weight-space with same total cost; tt is the 
reverse of tt'. □ 

Thus, the scheduling part of the speed-scaling scheduling problem reduces to a problem which 
can be solved by our PTAS from Sect.|21 Also, since the cost function f{x) = a:("~i)/" is concave 
for a > 1, the specialized PTAS in [21] also solves it. Combining Theorems [TTl and ITSl gives the 
main result. 

Theorem 19. There is a PTAS for the continuous speed-scaling and scheduling problem with 
a given energy budget E. Indexing jobs in the order of the solution given by the PTAS, the 
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(I + e)- approximate Pareto curve describing the approximate scheduling cost as a function of the 
available energy is given by the right-hand- side of Equation i.e., 



Proof. Define a new input instance with v'j = Wj and w'j = Vj and run a PTAS for the scheduling 
problem 1| | '^WjfiCj) with f : x ^ (Theorem [15] or 21 ). Let tt denote the reverse of 

the solution sequence. Theorem [18] guarantees that for a given optimal energy allocation, tt has 
scheduling cost at most a factor (1 + e) larger than the optimal sequence. This optimal energy 
allocation can be computed with the formulae in Theorem [T7] The optimal scheduling cost as a 
function of the energy budget E are described by Equation ([5]). Since tt is a (1 + £)-approximate 
solution this concludes the proof. □ 




5 Speed-scaling for discrete speeds 

In this section we consider a more realistic setting, where the machine can choose one out of k 
different speeds available si > . . . > > 1. For this problem we resolve the complexity status 
and show that it is NP-hard even when k — 2. We give an FPTAS for this problem when the 
number of available speeds and their maximum ratio are bounded by a constant. For the general 
problem with arbitrarily many speed states we give PTAS even for arbitrary power functions. 

5.1 A PTAS for discrete speeds 

Let the power function P{s) be an arbitrary computable function. To derive our algorithm, we 
adopt the PTAS for scheduling on a machine with given varying speed (Sect. [3]) and incorporate 
the allocation of energy. 

We adopt the same definitions of weight intervals lu and sets as in Sect. [3] We discretize 
the weight-space in intervals /„ = [(1 + e)"~^, (1 + e)") for w G Z and denote the length of the 
interval as \Iu\ ■— e(l + e)""^. For a subset of jobs S* £ J^u and a value ^ > 0, let E[u,S,z] 
be the minimum total energy necessary for scheduling S such that all completion weights are in 
interval or before, i.e., C™ < (1 + e)" for each j G S, and the scheduling cost is at most z, 
i-^-i X^jGS'^i ' — where Xj is the execution time under some feasible speed assignment. 
Recall that the speed assignment determines the energy. The recursive definition of a state is as 
follows: 

E{u, S, z) = mm{E{u - 1, 5', z') + APX„(S' \S',z~ z') : S' £ J"„_i, S" C S}. 

Here APX„(S' \S' ,z — z') is the minimum energy necessary for scheduling all jobs j ^ S\S' 
with Cj" G lu, such that their partial (rounded) cost J2jes\S' + most z — z' . This 

optimization problem can be solved by a linear program when aiming for a (l + e)-approximation 
of the optimal scheduling cost. In that case, we can round all completion weights up to powers of 
1 + £. Thus, jobs in S' C S, S' G J-u-i have the same completion weight (1 + e)'^. Then we can 
formulate the subproblem of computing APX„(S'\S", z — z') as a linear program. Let the solution 
variable £i > 0, i £ {!,...,«}, denote the length of the time interval in which the machine is 
running at speed Si. These lengths must guarantee that the total processing volume w(S" \ S) 
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can be completed (Eq. ([5])) and that the total scheduling cost does not exceed z — z' (Eq. ([5])). 



min^^, •P(s,) (7) 

i=l 

K 

^k-{\+eT <z- z' (9) 

I, > (10) 

We let the DP fill the table for u e {0, . . . , i^} with i' = [logX^jg/ ^^id z e [1, zub] for 

some upper bound such as zub = Sjej ""^i Sa:=i '^jl ^k- Then among all end states [z/, J, ■] with 
value at most the energy budget E we choose the one with minimum cost z. Then we obtain 
the corresponding (1 + e)-approximate solution for energy E by backtracking. 

This DP has an exponential number of entries. However, we can apply results from Section[2] 
and standard rounding techniques to reduce the running time. 

Theorem 20. There is an efficient PTAS for minimizing the total scheduling cost for speed- 
scaling with a given energy budget. 

Proof The DP computes a (1 + e)-approximation in exponential time. In Lemma [Ml we showed 
how to reduce the exponential number of subsets in J-u to a polynomial number at the cost of 
a factor 1 + 0{e) in the total scheduling cost. Recall that the sets given by that lemma are 
independent of the speed of the machine. Therefore we can use these sets directly in our setting. 

It remains to reduce the number of possible values of cost z G [0, zub]- At the cost of a 
factor 1 + e, we may round up in each state the scheduling cost to next integer power of 1 + (5 
with (5 = (1 + e)^/'^ — 1. In each state transition of the DP, we loose up to a factor 1 + 5 in the 
scheduling cost, which amounts to at most a factor (1 + SY = 1 + e under v state transitions. 
When restricting to powers of 1 + 5 then the number of different values in z S [0, zub] is bounded 
by 0{\ogi^g Zub) = 0{iy ■ logzuB/e)- Thus, the number of states in the table is polynomial. We 
conclude that the algorithm runs in polynomial time. □ 



5.2 Speed-scaling for discrete speeds is NP-hard 

We show that speed-scaling for discrete speeds is NP-hard. We provide a reduction based 
on the problem of minimizing the total weighted tardiness of jobs with a common due date, 
l\dj — d\J2 "WjTj, which is known to be NP-hard [53]. Here, Tj = max{Cj — d, 0} denotes the 
tardiness of job j. We use the following generalization of this result for our reduction. 

Lemma 21. The problem of of minimizing '^Wjf{Cj) on a single machine of unit speed is 
NP-hard even when f is non- decreasing, piecewise linear and convex with only one breakpoint. 

Proof. Let e > and define the cost function 




s ■ X if < X < d, 

Cj — d + ed if d < x. 



Note that Tj — fo{Cj). Now we show that, for e > small enough, minimizing '^jWjTj is 
equivalent to minimizing J^j Wjfe{Cj). 

Let fc g N, and assume that Wj,pj and d are natural numbers for all j. It is known that the 
problem of deciding whether there exists a schedule with 'Y^jWjTj < fc is NP-hard [24j. Now 



14 



notice that 



3 3-C'j<d 3-Cj>d 



Wi 



\3:Cj<d j.Cj>d J J 

Defining s — l/{d J2j wj) < 1 (which can be described with polynomiaUy many bits) we obtain 
that 




= e ■ \ > Wid + > dwi \ < edy Wi < 1 



3 3 

Therefore Wjfo{Cj) < fc if and only if J^j '^jfeiCj) < k + 1. We conchide that minimizing 
J2j Wjfe{Cj) is NP-hard, where £ < 1 is considered as part of the input. □ 

Now we are ready to give the main result. 

Theorem 22. The problem of minimizing WjCj on a single machine for discrete speeds is 
NP-hard, even if the number of available power levels is 2. 

Proof. The problem with fc > 2 speed states can be reduced to the case with 2 speed states, by 
adding dummy states of arbitrarily slow speed. Therefore, we prove hardness of the case of two 
speeds si > 82- 

Consider a scheduling instance on a unit-speed processor with the objective of minimizing 
y^^- Wjfe{Cj). We define an equivalent scheduling instance for minimizing WjCj on a machine 
with two possible speed states. In the new instance, the job set is the same and the values Wj 
and Vj for each job j are also preserved. Let si = l/£ and S2 ~ I- The total energy budget 
is i? = V + d{l/e"~^ — 1), where V denotes the total work volume, J^j "^j- simple interchange 
argument shows that in an optimal solution the machine runs at decreasing speeds. The time 
point when the speeds changes is uniquely defined by the energy budget and the total work 
volume. In this case, the machine runs at speed si until t — ed and then it runs at speed S2- It 
is easy to verify that the total work volume finishes by r is t • si = d. 

Consider now a schedule without idle time on a machine with the speed profile just described. 
Assume that by relabeling the jobs the completion times satisfy that Ci < C2 < . . . < C„. 
Consider scheduling the jobs in a unit speed machine using the same permutation of jobs. 
In this new schedule the completion times are = TlkKj'^k ^'-'^ ^- easy to check 

that fe{Cj) — Cj. We conclude that the problem of minimizing ^jWjJ^{C'j) is equivalent to 
minimizing WjCj on a machine that has speed l/£ in interval [0,£(i] and speed 1 afterwards 
until all jobs are done. By Theorem[2T]both problems are NP-hard, wich concludes the proof. □ 



5.3 An FPTAS for a constant number of speed states 

Let si>...>Sk>1-A simple interchange argument shows that an optimal solution chooses 
the speed non-increasing over time. We construct a schedule in weight-space. There are at 
most K jobs that run at more than one speed; call them split job. 

We may guess the split jobs and they completion weight at an affordable loss in the total 
cost. 

Lemma 23. At the cost of an increase in the scheduling cost by a factor 1+C'(£), we may assume 
that the completion weights of split jobs are integer powers of 1 + /3 for /3 = (1 + — 1. 

Proof. This follows directly from applying the technique of stretching intervals for each split job; 
see Sect. [31 Each time, this increases the total cost by a factor 1 + f3, which amounts to at most 
a factor (1 -I- /3)''~^ = 1 + £ for at most k — 1 split jobs. □ 
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A fixed choice of split jobs and their completion weights partitions the weight-space into k 
subintervals Ii, . . . , with length starting at > X^^Z^ \h\- To obtain a schedule, we have 
to fill the remaining jobs non-preemptively in these subintervals (keeping the split jobs where 
they are). By construction all jobs in one subinterval will run at the same speed, i.e., jobs in 
weight- interval /j will run at uniform speed where s- = Sn-i+i- (Recall that the job sequence 
in weight space corresponds to the reverse sequence in time space.) We want to find a schedule 
of minimum total cost and we have to control the energy consumption. 

5.3.1 Dynamic program 

We construct a DP that finds a partition of a set of remaining jobs into k subsets each of which 
is assigned to an individiial interval 7^. The jobs in each individual set are scheduled according 
to Smith Rule in weight space, that is, in non-decreasing order of ratios wj/vj. Let all jobs be 
indexed in this order. 

The dynamic program generates a state [k, z,yi, . . . , Uk-i] if there is a feasible schedule of 
jobs 1, . . . , fc, in which the total weight scheduled in interval li (excluding the split job) is yi, 
and where the total value (including split jobs) is z := xjCj' , where xj = vj/s'^ for a job j 

in interval i. The value of the state [k, z,yi,. . ., yK-i] is the minimum energy that is necessary 
for obtaining such a schedule. The dynamic program starts with the states [0, z, 0, . . . , 0]. For 
each z-value a linear program computes the minimum energy that is necessary to obtain this 
scheduling value when scheduling only the split jobs Jg . It determines the power that is assigned 
to each split job and thus their actual execution times. Let £ji be the amount of time that split 
job j is running at a valid speed s'^. 

K-l 

jeJ, i=i 

jeJs »=i 

iji>0, j eJs,ie{i,-.-,K} 

^ji = Oi j & Js, Si not valid . 

After computing the starting states, the DP computes all states by moving from any state 
[j — 1, z,yi, . . . , yK-i] to at most k new states [j, z',y[,. . . , y^-i]- We distinguish the k different 
possibilities of assigning job j: In case that job j is assigned to intervals 7, for some i e {1, . . . , k— 
1} then 

z' = z+^ ■ {u}i + yi + Wj) and y^ = y^ + Wj and y-, = yi' for i' ^ i , (11) 

provided that y- < \Ii \ — vj^, where ji is the ith split job. In the case that j is scheduled in 7„, 
then 

z' = z + ^ ■ iuj^ + ^wt-^yi'\ andy'i = yi,iG{l,...,K-l}. (12) 
''^i \ e=i e'=i J 

In any case, the value of the new state is 

E\j, z,yi,..., y^-i] = E[j -l,z,yi,..., y^-i] + ■ P{s'i). 

An optimal schedule can be obtained by finding a state S[n, 2;,yi, ... ,2/^-1] < E with 
minimum z and backtracking from that state. Since the z- values are bounded by Zjjb '■= 
^j^S'e^i '^(■1 Sk) and the y^-values are bounded by |7j|, the running time of this dynamic 
programming algorithm is 0{n ■ Zub ■ maxj 
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5.3.2 FPTAS 



In a fully polynomial time algorithm, we can neither afford to consider all possible objective 
values z, nor can we consider all possible j/i-values. As in the PTAS in Theorem [20l we can 
reduce the number of possible values of scheduling cost z G [0,zub] to 0{y ■ logzue/s) by 
restricting to powers of 1 + /3' with /?' = (1 + e)^/" — 1 and losing a factor 1 + e in the scheduling 
cost. 

The main challenge is to discretize the range of values in an appropriate way. Notice 
that we cannot afford to round y-values since they contain critical information on how much 
processing capacity remains in an interval. Perturbing this information causes a considerable 
change in the set of feasible schedules. In this way one might lose optimal schedules or introduce 
infeasible schedules with too much processing in an available interval. Both effects cannot be 
controlled easily, and thus, must be avoided. 

The intuition behind the following algorithm is to reduce the number of states by removing 
those with the same (rounded) objective value and nearly the same total work in available 
interval 7^. Among them, we want to store those with smallest amount of work in an interval 
li in order to make sure that enough space remains for further jobs that need to be scheduled 
there. 

Algorithm F: 

1. For an arbitrary given e > let (5 := ejn. 

2. Partition each interval li into sub-intervals ^ of length for fc = 1, . . . , [n/e] ; the last 
interval may be smaller, respectively. We say a value yi is in interval if '^i + j/i G Ii,k- 

3. Run the dynamic program DP with the following modification. Among the states for the 
same job set and the same (rounded) objective value z, we store at most one for each 
interval Ii,k, namely the one with currently minimum y- value within J^ fe. 



Lemma 24. Suppose that algorithm DP on an instance with n jobs finds a chain of stated 



[0, 0, 0, . . . , 0], [1, zl, yli, . . . , y%_i^i], . . . , [n, <, yt,„, • . . , yl-i,n]- Then Algorithm F finds for each 
j ^l,...,n a state [j, Zj,yij, J/k-1j] with 



Proof. We give a proof by induction on the number of jobs. For one job we store at most two 
states which obviously fulfill both conditions in (1131) . Suppose that the lemma is true for j jobs. 
Now consider state + ^ij+i, . . . , y*_ij+i] that was obtained from [j, z*, yl^ . . . , 2/*_ij] 

according to (|lip or We distinguish the two cases. 

First case: If state 1, ■ • ■ , Vk-i j+i] obtained from a state [j, z* ,yl . . . , , y*„i j] 
by adding job j + 1 to interval li, for some i £ {1, 1}, then Algorithm F when doing 
the same starting from [j, Zj,yij, . . . , y^-Lj] yields zj+i — Zj + Vj+i/ s[ ■ (w^ + yj + ifj+i) and 
yij+i = yij + Wj+i while it keeps j/i'j+i — yi\j for all z' ^ i. However, we cannot guarantee 
that this state will survive, because we might find a partial solution with the same objective 
value Zj+i but smaller values yi'^k within the same subinterval Ii',k, for i' £ {1, . . . , k — 1}. But 
in this case j/i' j+i — J/i' j+i is bounded from above by the length of interval li'^k and thus by 

\S. Thus, it holds for aU i' ^ i: 



ylj+i > yij+i > yi,j + Wj+i - \Ii\S > ylj - j\Ii\S + Wj+i - \Ii\S = - (j + l)\Ii\S. 




h'\Ss^>y; 



(J + 1)I^.'I^, 



and for the z-th y-value we get 



^ Chain of states means that, for j = 0, . . . ,n — 1, state [j + 1, z* 
[j^ tVi j t ■ ■ ■ ^y%—l j] tiy adding job j + I according to llllll or II12II . 



-j+i' Hi.j+i' ■ • • ' fK-l,j+l 



] is obtained from 
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Moreover, 

zj+i < 2j + ^ • + y,,j + w,+i) < (1 + (j - 1)5) z* + ^ . + yl^ + Wj+i) 
< {l+jS)z*^,. 

Second case: If state z*^^! J/i j+ij • • • 7 Uk-ij+i] obtained from state [j, z*,ylj, . . . , y*_]^ 
by adding j + 1 to the last available interval then Vij+i = yl^j for alH G {1, . . . , k — 1} and 
Algorithm F yields yi,j+i = yij when doing the same. If some values yi,j+i are later replaced 
by some smaller values yij+i in the same interval 7^^^, we still get 

yl-j+i > Uu+i > Vt^j - \h\5 > y*j+i - (j + i)\i,\s. 

Moreover, 

, "j+i ( 
Zj+i = Zj + ^ • I 

< (1 + (j - 1)5) z*^"-2±^.\u^^ + Y.n„-Y, (y;, ^. - j |/r |<5) 




< ( 1 + - 1 ) <5 ) z; + ( 1 + J <5 ) . ^ J + ^ - ^ y; 

'^'^ V (.= ! t = l J 

< {l+jS)z*^,. 

The inequalities are due to induction hypothesis, l-^^'l — '^k' ^^"^ (1121) . □ 

Now we can prove the main result. 

Theorem 25. There is an FPTAS for speed-scaling with a given energy budget for min'^WjCj 
on a single machine with constantly many discrete 



Proof. Let OPT denote the value of an optimal solution. Lemma guarantees that Algorithm 
F finds an end state that corresponds to a solution for all jobs of value z„ < {1 + {n — l)S)z*^ < 
(1 +e)2;*, where z* is an optimal solution when restricting to objective values that are multiples 
of sZi^B/n. Since the rounding error for at most n jobs accumulates to at most eZ^b, we have 
that z;; < (1 + £)OPT and thus z„ < (1 + e)20PT. 

It remains to show that the running time is polynomial in the input and 1/e. When restricting 
a split job's completion weight to powers of 1 + /3 for /3 = (1 + — 1 then the number 

of different completion weights for one split job is O{logl_^_p J2je.j)^j ^ ^i^^ ' '^/^)- Thus, the 
algorithm considers 0{{k ■ i'/e)'^~^) many possibilities for split jobs and their completion weight. 
For each situation, we have 0{i' ■ logzue/e) = 0{i^ ■ nje^) possible objective values. Concerning 
the 2/- values, we consider Xjb subintervals for each interval Jj for i e {1,...,k — 1}. Since 5 — e/n 
this amounts to {n/e)'^~^ different value assignments for k — 1 intervals. Thus, the algorithm 
has running time 0{n'^ v'^ n'^^^ / e^^) . □ 



5.4 FPTAS for a constant number of different-speed intervals 

In this section we briefly discuss a different application of the technique presented in Section [5.31 
Consider for the problem of scheduling jobs on a single machine with a constant number of time 
intervals each of uniform speed. Applying the ideas from Section 15.31 in time-space yields an 
FPTAS. More precisely, we obtain the following result. 

Theorem 26. There exists an FPTAS for non-preemptive"^ scheduling to minimize ^WjCj on 
a single machine with a constant number of intervals of different, but uniform speed. For the 
resumable^ setting, there is an FPTAS in the same setting when the maximum ratio of speeds is 
bounded. 
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To obtain this result we again guess the spht jobs, the availabihty intervals that they cross 
(in time-space), and their completion time. 

Lemma 27. We may assume that the completion times of split jobs are powers of I + s/s by 
loosing at most a factor 1 + e in the total scheduling cost. 

Proof. Consider an optimal solution with completion times for (at most) k — 1 split jobs 
C*, . . . , C*_]^. Now, for z = 1, . . . , K — 1 do the following: round C* up to the nearest power of 
Ci := (1 + e/s)'^, k € N, and shift the entire job to match this new completion time. We also 
adopt the schedule of the subsequent jobs in such a way that there is no idle time created, that 
is, no job is delayed but at least one job previously completing after i is processed now at least 
to some extent before i. Then no job but i is delayed and possibly executed at some lower speed. 
The amount of delay is bounded by 

Sk S • Sk \ sJ ' 

Thus, given an optimal schedule and rounding for each job i the value C* as described may lead 
to an increase of the completion time Ci by at most e • C* , that is, 

Cj ^Cj+e- C* < (1 + e)Cj . 

□ 

With the given speeds in each interval li we can compute also the starting time for each 
split job. This gives us a partition into time intervals in which we have to fill the remaining jobs 
non-preemptively. To do so, we apply the FPTAS from the speed-scaling setting in a simplified 
form in time-space. We simply compute all states without minimizing over energy. We omit a 
repetition of the DP and the analysis. 

We note that the above algorithm computes a permutation of jobs and thus a resumable 
scheduling solution. In a non-preemptive model the guessing step for split jobs is not neces- 
sary since there are no split jobs. In that case, we run the FPTAS above at a running time 
independently of the speed ratios. 

6 Preemptive scheduling with release dates 

A simple list scheduling approach allows to apply our previous results in the energy setting with 
non-trivial release dates. 



Algorithm List-Scheduling: 

1. Compute a solution sequence tt for the relaxed input instance with all release dates set 
to 0. 

2. Apply preemptive list scheduling according to tt, that is, run at any time t the job j with 
the highest priority in tt among the available (rj < t) but uncompleted jobs. 



Theorem 28. Algorithm List-Scheduling can be implemented as a factor (2 + e) approximation 
for continuous and discrete speed-scaling when jobs have individual release dates. 

Proof. In the first step we run the PTAS from Theorem 1 191 in the continuous-speed setting and 
from Theorem [201 in the discrete-speed setting, respectively. We keep the energy distribution 
over jobs and thus their speeds Sj as computed. Then we only need to argue on the increase of 
scheduling cost by preemptive list scheduling respecting release dates in the second step. The 
completion time of a job j is Cj = rj + ^k/sk. Now, the weighted sum over all jobs is 

bounded by the lower bound J2j wjrr and (1 + e) times the lower bound obtained by solving 
the relaxed problem without release dates. This concludes the proof. □ 

^Resumable jobs may run during a speed-0 interval; non-preemptive jobs must not. 
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